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Songs constitute a natural combination of lyrics and melodies, but it is unclear whether and 
how these two song components are integrated during the emergence of a memory trace. 
Network theories of memory suggest a prominent role of the hippocampus, together with 
unimodal sensory areas, in the build-up of conjunctive representations. The present study 
tested the modulatory influence of the hippocampus on neural adaptation to songs in lateral 
temporal areas. Patients with unilateral hippocampal sclerosis and healthy matched con- 
trols were presented with blocks of short songs in which lyrics and/or melodies were varied 
or repeated in a crossed factorial design. Neural adaptation effects were taken as correlates 
of incidental emergent memory traces. We hypothesized that hippocampal lesions, partic- 
ularly in the left hemisphere, would weaken adaptation effects, especially the integration 
of lyrics and melodies. Results revealed that lateral temporal lobe regions showed weaker 
adaptation to repeated lyrics as well as a reduced interaction of the adaptation effects 
for lyrics and melodies in patients with left hippocampal sclerosis. This suggests a defi- 
cient build-up of a sensory memory trace for lyrics and a reduced integration of lyrics with 
melodies, compared to healthy controls. Patients with right hippocampal sclerosis showed 
a similar profile of results although the effects did not reach significance in this population. 
We highlight the finding that the integrated representation of lyrics and melodies typically 
shown in healthy participants is likely tied to the integrity of the left medial temporal lobe. 
This novel finding provides the first neuroimaging evidence for the role of the hippocampus 
during repetitive exposure to lyrics and melodies and their integration into a song. 
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INTRODUCTION 

As humans, we learn and enjoy songs from a very early age on. 
Over the course of our lives, we hear and remember thousands 
of songs and, most of the time, we learn them implicitly with- 
out much effort especially after repeated presentations (as with 
hit songs on the radio). Songs naturally combine music and lan- 
guage into a unique acoustic signal. However, it remains unclear 
whether memory traces of lyrics and melodies are built separately 
or in integration. Indeed, evidence from healthy participants and 
brain-damaged patients diverge on this question. On the one hand, 
several behavioral studies in healthy participants support the tight 
association of lyrics and melodies during the creation of a song 
memory trace as shown by cueing effects of one element on the 
other during song recognition (Serafine et al, 1984, 1986; Crow- 
der et al, 1990; Baur et al, 2000; Peretz et al., 2004; Peynircioglu 
et al, 2008; Johnson and Halpern, 2012). On the other hand, neu- 
ropsychological studies in patients with lesions in the medial or 
lateral temporal lobes reveal dissociated recognition impairments 



for verbal and musical features of songs (Samson and Zatorre, 
1991; Hebert and Peretz, 2001). These results suggest that the nat- 
ural binding of lyrics and melodies into one unique song memory 
trace may be disrupted after brain damage. The present study 
seeks to find neural evidence for this hypothesis by investigating 
the effect of hippocampal damage on the emergence of integrated 
memory traces for lyrics and melodies during repeated exposure 
to songs. 

Research over the last two decades testifies to a growing aware- 
ness that the hippocampus - beyond its classical role in explicit 
episodic memory (Scoville and Milner, 1957; Mishkin, 1982; Zola- 
Morgan and Squire, 1993) - plays a role in the implicit build-up 
of a memory trace (Chun and Phelps, 1999; Graham et al., 2010) 
and the bridging between perception and encoding (Bussey and 
Saksida, 2005; Baxter, 2009; Suzuki, 2009; Suzuki and Baxter, 2009; 
Olsen et al, 2012). According to the Emergent Memory Account 
(Graham et al, 2010) advancing a non-modular view of mem- 
ory and perception, memory arises from a dynamic interaction 
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between the perceptual representations distributed across the 

whole brain and a key role of the medial temporal lobe. More 
specifically, the hippocampus is thought to form conjunctive rep- 
resentations of inputs from unimodal and polymodal sensory cor- 
tices and to continuously return the processed information to the 
sensory cortex via feedback connections (McClelland et al., 1995; 
Eichenbaum, 2000; Turk-Browne et al, 2006; Bast, 2007), thus con- 
stantly updating the current representations with new experiences. 
This cortico-hippocampal loop of flowing information guaran- 
tees the encoding of events and its storage (Eichenbaum, 2000). 
Note that this mechanism not only implies a shared, anatomically 
distributed cerebral network for both memory and perception, 
but also puts the medial temporal lobe into a cardinal position 
between perceptual processes (Lee et al, 2005; Lee, 2006; Lee and 
Rudebeck, 2010a) and memory (long-term as well as short-term 
and working memory: Zarahn, 2004; Axmacher et a]., 2007; Lee 
and Rudebeck, 2010b; Rose et al., 2012). Crucially, the hippocam- 
pus' combined role in (i) memory formation and (ii) conjunction 
of sensory inputs (Sutherland and Rudy, 1989; Eichenbaum et al., 
1994; Rudy and Sutherland, 1995; O'Reilly and Rudy, 2001; Win- 
ters, 2004; Cowell ct al., 2006, 2010; Barense ct al, 2007; Diana 
et al, 2007) makes it a potential key candidate for (i) the build- 
up of song memory traces, in which (ii) lyrics and melodies are 
integrated. 

Although most of the studies on the hippocampus' role in mem- 
ory formation and binding come from the visual domain (Davachi, 
2006; Diana et al., 2007; Shimamura, 2010), we hypothesize that 
similar processes also apply to the auditory domain (Overath et al., 
2007, 2008; Buchsbaum and D'Esposito, 2009), especially to songs. 
It is reasonable to assume that memory formation for lyrics and 
melodies happens through a cortico-hippocampal loop, and that 
the natural combination of a verbal and a melodic component into 
a single song percept and memory trace requires binding mecha- 
nisms as described above. Tentative support for this comes from 
lesion studies in patients with anterior temporal lobectomy for 
treatment of pharmaco-resistant epilepsy (Samson and Zatorre, 
1991). Using explicit recognition memory tasks after presentation 
of short unfamiliar songs, these experiments revealed a clear deficit 
in recognition of sung and spoken lyrics after left temporal lobe 
resection, and impaired recognition of melodies (without text) 
after right temporal lobe resection. On top of that, the data sug- 
gest a lack of integration of lyrics and melodies in patients with 
unilateral left (but not those with right) temporal lobe lesions. 
While patients with right temporal lobe resections had deficits in 
melody recognition when the tune was sung with new words, i.e., 
showing that they had bound the melody to the original lyrics, 
no such conjunction was observed in left-hemisphere damaged 
patients. In fact, their recognition of lyrics was impaired irrespec- 
tive of whether these were presented with (or without) old or new 
melodies, suggesting an independent processing of the two song 
components and an isolated deficit for lyrics. 

While these results lend initial support for our hypothesis of 
hippocampal involvement in song memory formation, they leave 
two important questions open: first, in how far can these deficit 
patterns be attributed to hippocampal dysfunctions, and second, in 
how far may these results depend on the use of a recognition mem- 
ory task? First, the resection always included anterior temporal 



lobe structures beyond the hippocampus, making it difficult to 

pinpoint a specific hippocampal role. Furthermore, although the 
lesion description was based upon the surgeon's meticulous draw- 
ings, a precise assessment of how far the resection extended into the 
hippocampus was not possible at that time. Moreover, although 
recognition tasks certainly depend on successful encoding, they 
also involve aspects of memory retrieval making it difficult to dis- 
entangle these effects with behavioral data. The present study seeks 
to address the points by first, testing patients with circumscribed 
unilateral hippocampal sclerosis (i.e., prior to surgery without fur- 
ther macroscopic lesions) and precisely describing the extent of 
hippocampal damage by means of volumetric analyses. Second, 
the incidental build-up of a song memory trace was assessed unbe- 
knownst to the participants by examining the dynamics of neural 
adaptation during natural passive listening as described below. 

Numerous studies have investigated the neural correlates of 
song processing (Samson and Zatorre, 1991; Brown et al., 2004a,b; 
Schon et al, 2005; Callan et al, 2006; Suarez et al, 2010; Mer- 
rill et aL, 2012; Saito et al., 2012; Tierney et al., 2012), however, 
rarely has any study touched upon the implicit emergence of song 
memory. Indirect evidence can be drawn from studies using the 
successive presentation of changed and unchanged song stimuli 
(Same vs. Different) (Schon et al, 2010) and neural adaptation 
paradigms (Sammlcr et al., 2010). Adaptation is "a reduction of 
neural activity following prolonged or repetitive exposure to iden- 
tical or at least similar stimuli" (Dobbins et al., 2004; Ganel et al., 
2006; Grill-Spector et al, 2006), similar to repetition priming 
(Old vs. New stimuli) (Krekelberg et al., 2006). Although typically 
described in studies on perception, it appears that neural adap- 
tation may also be indicative of memory trace formation. In line 
with the Emergent Memory Account (Graham et al., 2010), neural 
adaptation may reflect the emergence of a memory trace within 
cortical areas of perceptual representation through implicit learn- 
ing during repeated exposure. Given the role of the hippocampus 
in memory formation (Turk-Bro^vne et al, 2006) and according 
to connectionist models of memory (Damasio, 1989; McClelland 
et al., 1995; RoUs, 1996; Fuster, 1997), it is reasonable to suggest 
that cortical adaptation effects are subject to top-down modula- 
tions driven by the hippocampus (Blondin and Lepage, 2005; Gob 
ct al, 2007), including integration of lyrics and melodies through 
binding (for a review on binding, see Opitz, 2010). 

Of particular relevance for our research question of how lyrics 
and melody are bound in a conjunctive song memory trace are 
those studies describing the cerebral substrates underlying the 
integration of verbal and melodic components of songs (Samm- 
ler et al., 2010; Schon et al., 2010). These studies, which consider 
songs to be more than the sum of lyrics and melodies, examined 
modulations of brain activity to investigate how the two compo- 
nents interact, and how their processing is lateralized. For instance, 
Schon et al. (2010, Exp. 2) presented pairs of sung words that 
could vary or repeat in terms of the verbal and/or the melody 
component in a same-different task. Their results showed inter- 
active processing in the left and the right superior temporal gyrus 
(STG), suggesting an integrated processing of the two components 
in these areas. Sammler et al. (2010) adopted a similar approach, 
taking advantage of neural adaptation effects. In this study, healthy 
participants were presented with blocks of short songs in which 
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Table 1 | Demographic data. 

Group N Sex (males/females) Mean age Mean education Musical expertise Full scale IQWAIS-R^ RAVLT forgetting%' 



Control 19 9/10 32.63±7.90 14.53±2.99 5.02±3.68 

LTLE 12 5/7 34.17±8.71 12.25±1.42 6.75±4.46 92.66±1.33 20.56±22.99 

RTLE 12 5/7 39.92±1.23 12.67±2.71 6.83±5.45 95.43±1.79 9.07±10.76 



'Mean for all except two RTLE patients due two missing data. RAVLT: Rey Auditory Verbal Learning Test. 



repetition of lyrics and/or melodies was varied in a factorial design 
to induce selective adaptation to lyrics, melodies, or unified songs. 
Consistent with Schon et al. (2010), repeated lyrics or repeated 
tunes evoked adaptation effects in bilateral STG. Core areas of 
integration were found in the left middle superior temporal sulcus 
(STS) and the left premotor cortex (PMC) . Based on the previously 
reported literature, we hypothesize that these adaptation effects 
and the integration of lyrics and melodies are likely mediated by 
the hippocampus through feedback connections to STG/STS and 
binding of verbal and melodic information. 

To investigate the modulatory effect of the hippocampus on 
(i) the incidental emergence of a song memory trace and (ii) the 
integration of the verbal and melodic components of songs, we 
adopted the paradigm by Sammler et al. (2010) to test patients 
with unilateral left or right hippocampal sclerosis and healthy 
controls. We compared the patterns of adaptation produced by 
songs in which either the lyrics, or the melodies, or both were 
repeated. As demonstrated by diffusion-weighted imaging stud- 
ies, patients with hippocampal sclerosis present disconnections 
between medial and lateral temporal lobe regions (Focke et al., 
2008; Bettus et al, 2009; Diehl et al, 2010; Riley et al, 2010; Liao 
et al, 2011). Such lesions have the capacity to prevent the hip- 
pocampus from sending feedback predictions and from updating 
the sensory memory trace (as expected by default after repetitions) 
and thus weaken adaptation effects in general and integration of 
lyrics and melodies in particular. More precisely, following Sam- 
son and Zatorre (1991), we hypothesized reduced adaptation for 
lyrics after left and for melodies after right hippocampal sclerosis. 
Moreover, following previous studies showing binding deficits in 
patients with left anterior temporal lobe resections (Samson and 
Zatorre, 199 1 ) and correlates of lyrics-melody integration mainly 
in the left hemisphere (Sammler et al., 2010), we hypothesized 
that left hippocampal lesions, in particular, would have a negative 
impact on integration of lyrics and melodies in songs. 

MATERIALS AND METHODS 
PARTICIPANTS 

Twenty-four temporal lobe epilepsy patients with left (w=12; 
LTLE) or right (n = 12; RTLE) hippocampal sclerosis participated 
in this study. They all presented with medically intractable epilepsy 
and were seen during pre-surgical evaluation at Pitie-Salpetriere 
Hospital (Paris, France). All patients were right-handed according 
to the Edinburgh Handedness Inventory (Oldfield, 1971), except 
for one LTLE (-83.33) and one RTLE patient (-75). All patients 
had language lateralization to the left hemisphere except for the 
left-handed RTLE patient with bilateral language representation. 
Language lateralization was assessed by means of a verbal fluency 
test that is part of the standard functional magnetic resonance 



imaging (fMRI) assessment prior to epilepsy surgery at the Pitie- 
Salpetriere Hospital. In the scanner, patients are required to think 
as many words of a semantic category (e.g., tools) as possible. The 
number of activated left and right fronto-temporo-parietal voxels 
against baseline was used to calculate a standard language later- 
alization score (Lehericy et al., 2000; Thivard et al., 2005). The 
control group consisted of 19 right-handed healthy participants 
including 12 subjects, who had already participated in a previous 
study (Sammler et al, 20 1 0), and 7 new volunteers. All participants 
were French native speakers and reported to have normal hearing. 
Controls were carefully selected to match the patient groups in 
terms of age, mean years of education, and musical expertise (Ehrle 
musical expertise questionnaire, unpublished). A verbal memory 
deficit was present in the LTLE as opposed to the RTLE patients, as 
assessed with the Rey Auditory Verbal Learning Test (RAVLT) (Rey, 
1964; Sziklas and Jones-Gotman, 2008) in accordance with the 
usual neuropsychological profile of these patients. Demographic 
characteristics of the participants are summarized in Table 1. The 
sclerosis in either left or right hippocampus in the two patient 
groups was corroborated by a volumetric analysis using Freesurfer 
software (Fischl, 2012; Renter et al., 2012) that attested an ipsilat- 
eral hippocampal volume reduction of an average of 24.5 1% in the 
LTLE and 29.71% in the RTLE group compared to healthy con- 
trols. Between-group comparisons confirmed the significance of 
these volume reductions in the atrophic hippocampus (p < 0.05). 
Volumes and percentage of reduction are summarized in Table 2 
(for details on the volumetric analysis, see Data Analysis). The 
local ethics committee approved this study and informed consent 
was obtained from each participant. 

MATERIALS 

The material and the scanning protocol used here were previously 
published by Sammler et al. (2010). The stimulus set consisted 
of 48 blocks of 6 unfamiliar songs based on a collection of nine- 
teenth century French folk songs (Robine, 1994). Each song within 
a block was sung by a different singer to avoid adaptation to the 
singer's voice (Belin and Zatorre, 2003), had a duration of 2.5 s and 
was followed by a 0.2 s pause. Repetition of lyrics and/or melodies 
within blocks was crossed in a 2 x 2 factorial design, forming four 
conditions. Songs within a block either had the same melodies 
and same lyrics (SmSl), the same melodies but different lyrics 
(SmDl), different melodies with same lyrics(DMSL), or different 
melodies and different lyrics (DmDl). Mode and tempo were bal- 
anced across the stimulus set, and each song had an average of 7.65 
notes and 5.61 words. Songs in the four conditions did not differ 
with respect to length and number of word/note, word frequency, 
interval size, and number of contour reversals. In blocks where 
lyrics were varied, they did not rhyme, were semantically distant. 
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Table 2 | Medial temporal lobe (MTL) volumes (mm^). 

Left MTL Right MTL 



LTLE RILE LTLE RILE 



Region Mean/±SD Reduction Mean/±SD Reduction Mean/±SD Reduction Mean/±SD Reduction 

(%)» (%)» (%)» (%)» 



Hippocampus 2606.17/506.82 24.51 3467.95/245.68 -0.46 3589.04/678.97 -2.72 2455.90/414.39 29.71 

Entorliinal cortex 1802.67/613.03 3.86 184725/218.17 1.48 1901.42/408.28 3.73 1802/246.24 8.76 

Paraliippocampal gyrus 2165.42/386.72 8.56 2304.17/222.16 2.70 2249.92/266.37 3.26 2026/268.47 12.89 



"Percentage of reduced volumes as compared to control group volumes. 

and differed with respect to syntactic structure avoiding potential 
adaptation to phonology, semantic content, or syntactic structure 
(Noppeney and Price, 2004). 

PROCEDURE 

Participants were instructed to listen attentively vnth closed eyes 
while avoiding moving, humming, or singing along. No behav- 
ioral data were collected. Stimuli were presented using E-Prime 1 . 1 
(Psychology Software Tools) and delivered binauraUy through air 
pressure headphones (MR confon). Additionally, participants used 
earplugs to minimize noise interference. All blocks were presented 
in one of four pseudorandom orders, with a silent gap between 
blocks of 10 s (±0.5 s) allowing the hemodynamic response to 
return to baseline (Belin and Zatorre, 2003). This resulted in a 
total duration of the experiment of around 30 min. Blocks of the 
same condition were not presented more than twice in a row. At the 
end of the experiment, all participants filled in a debriefing ques- 
tionnaire with several nine-point scales (1 = not at all, 9 = always) 
in which they rated their attention during listening at 7.63 (Con- 
trols), 7.00 (LTLE), 7.57 (RTLE), and the amount of overt and/or 
covert singing during scanning at 0.00 and 2.89 (Controls), 0.47 
and 2.71 (LTLE), and 0.21 and 2.14 (RTLE), showing that they had 
followed the instructions. 

SCANNING 

Functional magnetic resonance imaging was performed using 
a 3-T Siemens TRIO scanner (Siemens, Erlangen, Germany) 
at the Centre de Neuroimagerie de Recherche at the Institut du 
Cerveau et de la Moelle Epiniere - ICM (Croupe Hospitaller 
Pitie-Salpetriere, Paris, France) . Radiofrequency transmission was 
performed with a body coil and the signal was received with a 
12-channel head coil. Before the functional scans, high-resolution 
Tl -weighted images (1x1x1 mm^ voxel size) were collected for 
anatomical coregistration using a magnetization-prepared rapid 
acquisition gradient-echo (MPRAGE) sequence (TR = 2300 ms, 
TE = 4.18 ms). Subsequently, one series of 595 blood oxy- 
genation level-dependent (BOLD) images was obtained using 
a single-shot echo-planar gradient-echo (EPI) pulse sequence 
(TR = 2120 ms, TE = 25 ms, the first six volumes were later dis- 
carded to allow for Tl saturation). Forty- four interleaved slices 
(3 mm X 3 mm x 3 mm voxel size, 10% interslice gap) perpendic- 
ular with respect to the hippocampal plane were collected. The 
field of view was 192 x 192 mm^ with an in-plane resolution of 



64 X 64 pixels and a flip angle of 90°. Scanner noise was con- 
tinuous during the experiment representing a constant auditory 
background. 

DATA ANALYSIS 

The fMRI data were analyzed using SPM8 (Wellcome Trust Cen- 
tre for Neuroimaging). Preprocessing included spatial realignment 
and reslicing and coregistration of the anatomical Tl to the mean 
functional data. The first level analysis was carried out in the native 
space. Four regressors were built for each experimental condition 
based on the general hnear model (different melodies and dif- 
ferent lyrics (DmDl); same melodies and different lyrics (SmDl); 
different melodies and same lyrics (DmSl) and same melodies and 
same lyrics (SmSl), and convolved with a hemodynamic response 
function (HRF). Movement parameters were included as regres- 
sors of no interest and serial correlations were modeled with an 
AR (1) process. A temporal high-pass filter with a cut-off of 200 s 
was used to eliminate low-frequency drifts. Six one-sample f-tests 
were computed for each participant: all conditions against silence 
to establish a "song-sensitive" mask, the main effects of adaptation 
to lyrics [(DmDl 4- SmDl) - (DmSl + SmSl)] and to melodies 
[(DmDl + DmSl) - (SmDl -|- SmSl)] to identify areas of general 
adaptation to the repetition of song components, as well as the 
interaction [(DmSl -l- SmDl) - (DmDl 4- SmSl)] to isolate areas 
of lyrics-melody integration. For the sake of completeness and 
consistency with the analysis of Saninilcr ct al, (2010), we addi- 
tionally compared both main effects to identify brain regions that 
showed an independent processing of either lyrics or melodies (i.e., 
stronger adaptation for lyrics than for melodies [2 x (SmDl)] and 
vice versa [2 x (DmSl)]). 

Segmentation of the anatomical files was performed with the 
VBM8 toolbox ( ^ ■ l^.biirncr and Friston, 2005) to form a normal- 
ized anatomical image and the DARTEL exported tissue types. A 
template with eight iterations was created in DARTEL (Ashburner, 
2007) including all 43 subjects to improve anatomical accuracy in 
the normalization of the functional contrast images obtained in 
the first level. Contrast images were spatially smoothed using a 
three-dimensional Gaussian kernel with 8 mm fuU width at half 
maximum. For the second level, the DARTEL normalized contrast 
images were normalized to the Montreal Neurological Institute 
(MNI) space. The automatically generated mask from the first 
level analysis of each subject was also normalized with this pro- 
cedure but without smoothing. Statistical analysis was confined 
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to a song-sensitive mask in gray matter to increase signal detec- 
tion (Friston et al., 1994). To create this mask, a binary mask from 
the last iteration of the DARTEL template thresholded at 0.3 was 
overlaid with active voxels in the "all conditions against silence" 
contrast at p < 0.05 (FWE correction for multiple comparisons), 
/c> 5 for all 43 participants. All voxels that were involved in both 
were included into the explicit song-sensitive mask for statistics. 
This mask covered an auditory-motor network, including the tem- 
poral gyrus, the PMC, and the cerebellum. For random effects 
group analyses, the individual contrast images were submitted to 
one-sample f -tests, separately for healthy controls, LTLE and RTLE 
patients. Furthermore, two-sample f-tests were computed for all 
contrasts, comparing each patient group against controls. All SPMs 
were threshold atp < 0.001 (uncorrected) with a minimum cluster 
extent of > 5 voxels. Results will report the peak voxel p value 
and the number of voxels (k). 

To assess the size of the hippocampal sclerosis and surround- 
ing cortex, volumetric measures of hippocampal, entorhinal, and 
parahippocampal gyrus were obtained for all participants with 
the Freesurfer image analysis suite (Fischl, 2012; Renter et al., 
2012), which is documented and freely available for download- 
ing online (http://surfer.nmr.mgh.harvard.edu/). Non-parametric 
tests (Kruskal-Wallis, SPSS 18.0) were used to compare these mea- 
sures between the patient and controls groups. To control global 
differences, intracranial volume was included in the analysis as a 
covariate, which was not found to be significant. The percentage of 
reduction of each structure was calculated for each patient group 
in comparison to the control group and is reported in Table 2. 

RESULTS 
MAIN EFFECTS 

A complete report of the results at threshold p < 0.001 (uncor- 
rected) with a minimum cluster extent of /c > 5 voxels can be seen 
in Table 3. All three groups of participants showed adaptation to 
lyrics in the left and right STG and STS that was however consid- 
erably more extended in Controls (2474 and 2423 voxels) than in 
LTLE (541 and388 voxels) and RTLE patients (201 and 165 voxels). 
Between-group comparisons revealed significantly weaker adap- 
tation effects in the LTLE but not in the RTLE as compared to 
Controls in the left STS (Figure lA). 

In all three groups, adaptation to melody was found in the left 
and right STG and STS, again more extended in Controls (2380 
and 1830 voxels) than in LTLE (245 and 295 voxels) and RTLE 
patients (106 and 111 voxels), as well as in the cerebellum. The 
Control group showed, in addition, adaptation in the left PMC (52 
voxels) that was not observed in patients (Figure IB). However, 
between-group differences failed to reach significance. 

INTERACTION EFFECTS 

Interaction effects were calculated with the contrast [(DmSl-|- 
SmDl) - (DmDl + SmSl)] and were taken to represent an inte- 
grated processing of lyrics and melodies in songs. Only the con- 
trol group showed interaction effects at p < 0.001 k>5, which 
were located in the bilateral posterior STG/STS (left: 169 vox- 
els and right: 323 voxels). No such effect was observed in LTLE 
and RTLE patients. To visualize areas that simply may not have 
passed our statistical criterion, we inspected the data at a very 




FIGURE 1 I Main effects of Adaptation to Lyrics (A) and Melody (B), and 
the Interaction (integration contrast) (C). Threshold p < 0.001 k>5 
uncorrected. Results for Control group (red), LTLE (blue), RTLE (green), and 
Controls vs. LTLE (yellow). 



lenient level of p < 0.05 uncorrected (fc>5). Controls showed 
an extended region within the left (1936 voxels) and right (2176 
voxels) STG/STS (Figure 2A). At this threshold, RTLE patients 
showed a pattern that was similar to Controls, but consider- 
ably less extended (554 and 1501 voxels) (Figure 2B). Inter- 
estingly, LTLE patients showed nearly no interaction in the 
temporal lobe at this very lenient threshold (238 and 35 vox- 
els) (Figure 2C). Indeed, between-group comparisons revealed 
a significantly weaker interaction effect in the LTLE than the 
Control group in the right STG (Figure IC) whereas the 
difference between the RTLE patients and Controls did not 
reach significance. Details on interaction effects are shown in 
Table 4. 

INDEPENDENCE EFFECTS 

Greater adaptation to lyrics as compared to melody was found 
bilaterally in the anterior region of the STG (23 and 196 vox- 
els) in the control group, suggesting an independent processing of 
lyrics in this region. Greater adaptation to melody as compared to 
lyrics was obtained bilaterally in the cerebellum in RTLE patients. 
LLowever, between-group differences failed to reach significance 
(Figure 2A). Details on independence effects are shown in Table 4. 
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FIGURE 2 I "Gradient of integration" for (A) the Control group (B) RTLE 
and (C) LTLE patients. Specificity for iyrics is shown in red (p < 0.001 k>5 
uncorr.), interaction in dark blue (p < 0.001 k>5 uncorr.) and weaker 
interaction in cyan (Interaction at p < 0.05 k>5 uncorr.). 



DISCUSSION 

The aim of the current study was to assess the modulatory effects 
of a unilateral hippocampal lesion on the incidental emergence of 
a song memory trace and the integration of lyrics and melodies 
into a conjunctive representation. To this end, neural adaptation 
to song repetition - as a proxy for song memory formation - was 
examined in patients with left or right hippocampal sclerosis in 
comparison to healthy controls using an fMR-adaptation para- 
digm. It was hypothesized that damage to the hippocampus may 
disrupt feedback connections to the lateral temporal lobe and 
thus preclude the establishment and update of a sensory memory 
trace. As a consequence, damage to the hippocampus may result in 
weaker neural adaptation in the STG. In particular, hippocampal 
lesions could hinder the integration of lyrics and melodies into a 
unified memory trace (Diana et al., 2007; Staresina and Davachi, 
2009; Graham et al, 2010; Shimamura, 2010). 

The main findings of this study were indeed that the neural 
adaptation to lyrics repetition as well as the integration of lyrics 
and melodies in songs (as reflected by the statistical interaction 
between adaptation effects for lyrics and melodies) was reduced 
in patients with left hippocampal sclerosis. More specifically, the 



direct comparison of these patients with healthy control partic- 
ipants revealed a weaker adaptation to lyrics in the left STS and 
a weaker integration of lyrics and melodies in the right STG. If 
one accepts the notion that neural adaptation reflects the emer- 
gence of a memory trace (see Introduction), these results are in 
line with our hypotheses and previous work showing that left 
hippocampal damage may lead to weaker memory for lyrics (Sam- 
son and Zatorre, 1991) and may hinder the integration of lyrics 
and melodies into a unified memory representation (Samson and 
Zatorre, 1991; Sammler et al, 2010). 

All three groups of participants showed adaptation to the rep- 
etition of lyrics or melodies in the bilateral STG and STS, but in 
both patient groups, these effects were markedly smaller in spa- 
tial extent when compared to healthy controls. Notably, patients 
with left (but not right) hippocampal sclerosis exhibited signif- 
icantly decreased adaptation to lyrics in the left STS, which is 
known to play a role in phonemic processing and also known 
to be crucial for the perception of a sound as speech (Dehaene- 
Lambertz et al, 2005; Liebenthal, 2005; Mottonen et al, 2006; 
for a review on STS, see Hein and Knight, 2008). This finding is 
most likely tied to the role of the left medial temporal lobe in 
verbal processing (Meyer et al., 2005; Wagner et al., 2008; Greve 
et al., 2011) and may reflect the perturbed build-up of memory 
traces for lyrics (and verbal material in general) due to disrupted 
feedback connections between medial and lateral structures of the 
left temporal lobe (Eichenbaum, 2000). Such an interpretation 
could be supported by the verbal memory deficit documented in 
the LTLE patients of the present study (assessed with the RAVLT) 
and, although we did not collect behavioral data for this exper- 
iment, these results are also in agreement with the behavioral 
results of Samson and Zatorre (1991). That study showed that 
the recognition of sung lyrics after listening to unfamiliar songs 
was impaired in patients with left (but not right) medial temporal 
lobe lesions. 

Although patients with right hippocampal sclerosis showed 
nominally reduced adaptation and integration effects, these did 
not significantly differ from those in healthy controls, suggest- 
ing rather normal song processing and lyrics-melody integration 
in these patients. While the latter is in line with previous behav- 
ioral data showing spared integration of lyrics and tunes after 
right anterior temporal lobe resection (Samson and Zatorre, 1 99 1 ), 
our hypothesis on reduced adaptation to melodies was not con- 
firmed. This may partly be due to the stimulus material used: even 
if melodies were repeated to induce adaptation, they differed in 
octave sung by sopranos, tenors, altos, and bass. Most likely, adap- 
tation effects are not fully robust to transposition of melodies. 
Furthermore, adaptation to melodies was generally weaker than 
adaptation to lyrics, as attested by the results in healthy partic- 
ipants, possibly resulting in a floor effect. Our participants may 
have paid less attention to melodies than to lyrics (as the lat- 
ter convey the message) leading to weak adaptation, given that a 
lack of attention reduces adaptation effects (Chee and Tan, 2007). 
Alternatively, several lines of evidence suggest that melodies may 
be processed more bilaterally than lyrics (Samson and Zatorre, 
1992; Binder et al, 2000; Besson and Schon, 2003; Peretz and 
Coltheart, 2003; Schon et al., 2005; Patel, 2008; Koelsch, 2012), 
leading to less severe deficits in processing melodies than in verbal 
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Group Adaptation for lyrics Adaptation for melody 



Table 3 | Main effects of lyrics and melodies repetition for each group and comparison between Controls and LTLE. 
Group Adaptation for lyrics Adaptation for melody 



Control 



LTLE 



RTLE 



Area 



Size(J^) 



X, y, z (mm) 



Left hemisphere 
STG/STS 



Temporal pole 

Right hemisphere 
STG/STS 



Cerebellum 

Left hemisphere 
STG/STS 



Right hemisphere 
STG/STS 



Left hemisphere 
STG/STS 



Right hemisphere 
Temporal pole 
STG/STS 



2474 



2423 



10 



541 



388 



201 



165 



-58, -6, -6 
-62, -17 0 
-48, -39, 6 
-51, 9, -18 



62, -9, -6 
60, -0,-13 
62, -24, 2 
16, -80, -46 



-58, -5, -9 
-56, -15, -3 
-66, -33, 9 

62, -9, -7 
62, 2, -13 



-63, -6, -6 



62, 3, -10 

63, -9,-10 



5.75 
5.52 
5.04 
3.86 



5.59 
4.76 
4.39 
3.26 



4.60 
4.05 
3.65 

4.41 
3.85 



4.26 



4.13 
3.29 



Area 



Left hemisphere 
STG/STS 



PrCG 

Cerebellum 
Right hemisphere 
STG/STS 



Cerebellum 
Cerebellum 
Left hemisphere 
STG 



Right hemisphere 
STG/STS 



STG/STS 

Cerebellum 
Left hemisphere 
STG/STS 

Cerebellum 
Right hemisphere 
STG/STS 
STG/STS 
STG/STS 



Size(fc) 



X, y, z (mm) 



2380 

52 
55 

1830 

148 
27 

245 

295 

92 
134 

106 

20 

61 
50 
21 



-64, -29, 3 

-54, -39, 3 
-66, -39, 12 
-52, -5, 51 
-26, -62, -55 

60, -17,-3 
46, -36, 2 
62, -0, -10 

34, -63, -58 
30, -57 -27 

-57 -21, -3 



62, 2, -9 
57 -11, -9 
58, -5, -1 
54, -24, 0 
26, -74, -60 

-66, -26, 2 

-58, -18, 6 
-24, -66, -60 

62, -0,-12 
58, -17-4 
69, -36, 3 



Control vs. LTLE Left hemisphere 

STS 25 -46, -39, 6 



processing after unilateral temporal lobe damage. Further studies 
wiU be necessary to clarify this issue. 

One novel finding is the main effect of melodies in the cere- 
bellum in all groups (without group differences). Since activity 
in the cerebellum has been frequently reported in other studies 
using sung material (Parsons, 2001; Callan et al., 2007; Lebrun- 
Guillaud f-t nl. onoS; Tillmann et al., 2008; Merrill et al, 2012), 
these effects may be linked to optimization of the fine sensory 
acquisition and internalization of input-output characteristics of 
stimuli, a process related to the creation of internal models of 
vocal articulation (Parsons, 2001; CaUan et al., 2007; Stoodley and 
Schmahinanii, 2009), that may fimction independently from the 
hippocampus. 

As previously reported (Sammler et al., 2010), healthy partic- 
ipants presented maximum integration of lyrics and melodies in 
the posterior STS with a continuous decay of the lyrics-melodies 
integration along the posterior-anterior axis, toward regions of 
independent processing of lyrics in the anterior STG. These effects 



z 



5.12 

4.75 
4.68 
3.94 
3.75 

5.85 
4.26 
4.12 
3.91 
3.59 

4.68 



3.69 
3.67 
3.44 
3.69 
3.67 

3.55 

3.24 
3.30 

3.95 
3.65 
3.41 

Left hemisphere 

3.60 



were shown bilaterally in the present experiment, expanding the 
previously reported effect, which was restricted to the left hemi- 
sphere. This analysis illustrates a "gradient of integration" from 
more to less integrated processing. In line with the literature 
on music and language (Scott et al., 2000; Davis and John- 
srude, 2003; Scott and Johnsrude, 2003; Friederici, 2011; Gow, 
2012), this gradient poses an integrative processing of songs at 
the prelexical and phonemic level in the mid-STS. Consequently, 
information can be transmitted both along an anterior pathway 
to the temporal pole for an independent analysis of the lin- 
guistic content, and along a posterior pathway to the left PMC 
for the integrated sensori-motor conversion of the stimuli. In 
other words, lyrics and melodies might split up in the ventral 
pathway for semantics and comprehension (Griffiths, 2001; Pat- 
terson et al., 2002; Hickok and Poeppel, 2007; Sai- "^8; 
Friederici, 2009, 2011; Hickok et al., 2011) but stay integrated in 
sensori-motor dorsal pathways (Kiebel et al., 2008; Loui et al., 
2009). 
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Table 4 | Integration and independence for each group and between 
controls and LTLE. 



INTEGRATION 


Group 


Area 


Size [k] X, y, z (mm) 


Z 



Control Left hemisphere 










STG 


164 


-63, 


-23, 2 


4.39 


MTG 


5 


-63, 


-36, 3 


3.18 


Right hemisphere 










STG 


235 


56, 


-32, 0 


3.98 






51, 


-38, 5 


3.65 


STG 


82 


66, 


-11, -3 


3.76 


STG 


6 


66, 


-20, 0 


3.18 


Control vs. LTLE Right hemisphere 










STG 


6 


63, 


-9, 0 


3.19 


LYRICS INDEPENDENCE 


Control Left hemisphere 










STG 


15 


-63, 


-5, -6 


3.29 


STG 


8 


-46, 


-41, 8 


3.32 


Right hemisphere 










STG 


196 


63, 


-6, -15 


3.79 


MELODY INDEPENDENCE 


LTLE Left hemisphere 










Cerebellum 


10 


-18, 


-66, -61 


3.58 


RTLE Left hemisphere 










Cerebellum 


19 


-12, 


-83, -46 


3.77 


Cerebellum 


6 


-24, 


-66, -61 


3.21 


Right hemisphere 










Cerebellum 


18 


14, 


-81, -46 


3.62 



Taken together, adaptation to lyrics and integration of lyrics 
and melodies within songs appear to be less efficient in patients 
with left hippocampal damage as compared to healthy controls. 
We propose that these lesions may hinder the build-up of a sen- 
sory memory trace for lyrics (with rather preserved mechanisms 
for melodies), which in turn might be at the origin of the reduced 
integration of lyrics and melody. These combined effects could 
be attributed to hippocampal malfunction per se or to a more 
global disconnection of lateral temporal neocortical structures 
caused by repetitive seizures or epilepsy history (Yasuda et al., 20 10; 
Besson et al, 2012), both of which can disrupt the hippocampal 
top-down modulatory influence on STG/STS. If this is the case, 
it is possible that adaptation could also be reduced for stimuli 
other than lyrics, melodies, or songs, demonstrating a more gen- 
eral adaptation and putative encoding deficit following disruption 
of cortico-hippocampal processing loops. 

Interestingly, an independent analysis of the connectivity pro- 
files in our patients showed asymmetries between the left and 
right hemispheric lesion groups: LTLE patients exhibited more 
extended and more strongly left-lateralized disconnections, as 
opposed to more discrete and bilateral connectivity deficits in 
RTLE (Besson et al., 2012). Such differences in connectivity pro- 
files provide an additional explanation for the nominally stronger 
impairments in patients with left hippocampal sclerosis as com- 
pared to patients with right hippocampal sclerosis. In sum, the 
present data indicate that an imbalance in the left hippocampo- 
cortical system, due to hippocampal sclerosis and/or disrupted 
connectivity with STG/STS, affects the incidental emergence of 
a memory trace of verbal song components and precludes the 
build-up of a conjunctive representation that integrates lyrics and 
melodies. 



Contrary to healthy participants, both patient groups showed 
very weak levels of lyrics-melody integration in the bilateral 
mid-STG/STS, and only after lowering the statistical threshold 
to p < 0.05 (uncorrected). This effect may reside on generally 
weaker adaptation effects in both patient groups. The spatial extent 
of this weak lyrics-melody interaction was particularly small in 
patients with left hippocampal sclerosis who also showed a signif- 
icantly reduced interaction effect in the right STG as compared to 
controls. These tendencies suggest a partial (although not com- 
plete) disruption of integrated processing in clinical populations 
and indicate that the conjunctive representation of lyrics and 
melodies depends on intact medial temporal lobe structures, par- 
ticularly in the left hemisphere. Overall, this finding is in line with 
previous studies in patients with anterior temporal lobe resec- 
tion including parts of the hippocampus (Samson and Zatorre, 
1991). These experiments showed a perturbed integration of ver- 
bal and melodic song components in patients with left (but not 
right) temporal lobe resections, i.e., a selective deficit in recog- 
nizing lyrics that was independent from recognition memory for 
melodies. It is worth to mention that in both the present and 
previous studies, the integration deficit may reside on a more 
general deficit to process lyrics, as supported by the weaker adap- 
tation for lyrics and reduced performance in neuropsychological 
tests on verbal memory in our patients with left hippocampal 
sclerosis. 



CONCLUSION 

To the best of our knowledge, this is the first study to investigate 
the processing of songs using fMRI in patients with unilateral hip- 
pocampal sclerosis. We showed that the adaptation to lyrics and 
the integration of lyrics and melodies was diminished in lateral 
temporal lobe regions in patients with left hippocampal sclero- 
sis while a similar but non-significant result pattern was found 
in patients with right hippocampal sclerosis. These findings sug- 
gest the importance of hippocampal top-down modulations on 
the STG/STS during repetitive exposure to songs. We interpret 
the observed adaptation patterns to be a result of a disturbed 
connectivity in a hippocampal-cortical network, weakening the 
emergence of a memory trace for lyrics and the integrated pro- 
cessing of songs as a unified percept. Overall, these data provide 
a novel contribution by suggesting that the integration shown in 
healthy participants is tied to the integrity of the medial temporal 
lobe and its connections with the lateral temporal cortex. 
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